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We study the data complexity of model-checking for logics with team semantics. For dependence 
and independence logic, we completely characterize the tractability/intractability frontier of data 
complexity of both quantifier-free and quantified formulas. For inclusion logic formulas, we reduce 
the model-checking problem to the satisfiability problem of so-called Dual-Horn propositional for¬ 
mulas. Via this reduction, we give an alternative proof for the recent result showing that the data 
complexity of inclusion logic is in PTIME. 

1 Introduction 

In this article we study the data complexity of model-checking of dependence, independence, and inclu¬ 
sion logic formulas. Independence and inclusion logic iflOl |4|] are variants of dependence logic iflTl that 
extends first-order logic by dependence atoms of the form ={x \,... ,x„) expressing that the value of x„ is 
functionally determined by the values of the variables jq,... ,x n -\. In independence and inclusion logic 
dependence atoms are replaced by independence and inclusion atoms y _L r z and x C y, respectively. 
The meaning of the independence atom is that, with respect to any fixed value of I. the variables y are 
independent of the variables z, whereas the inclusion atom expresses that all the values of x appear also 
as values for y. 

Dependence logic is a new framework for formalizing and studying various notions of dependence 
and independence pervasive in many areas of science. The novelty of dependence logic is in its team 
semantics in which formulas are interpreted using sets of assignments (with a common finite domain 
{xi,..., x n } of variables) instead of single assignments as in first-order logic. Reflecting this, dependence 
logic has higher expressive power than classical logics used for these purposes previously. Dependence, 
inclusion, and independence atoms are intimately connected to the corresponding functional, inclusion, 
and multivalued dependencies studied in database theory, see, e.g., ifTTl . Interestingly, independence 
atoms can also be viewed as a qualitative analogue of the notion of conditional independence in statistics, 
see Q. Furthermore, a variant of dependence logic is in the heart of Inquisitive Semantics which is a 
novel approach in linguistics that analyzes information exchange through communication, see |T). 

Dependence logic and its variants can be used to formalize and study dependence and independence 
notions in various areas. For example, in the foundations of quantum mechanics, there are a range 
of notions of independence playing a central role in celebrated No-Go results such as Bell’s theorem. 
Abramsky and Vaananen have recently showed that, under a relational view on these results, some of 
these No-Go results can be logically formalized and syntactically derived using the axioms of indepen¬ 
dence and dependence atoms. For another application of team semantics in quantum information theory, 
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see fl3l . Similarly, in the foundations of social choice theory, there are results such as Arrow’s Theorem 
which can also be formalized in the team semantics setting. 

For the applications it is important to understand the complexity theoretic aspects of dependence logic 
and its variants. During the past few years, these aspects have been addressed in several studies. We will 
next briefly discuss some previous work. The data complexity of inclusion logic is sensitive to the choice 
between the two main variants of team semantics: under the so-called lax semantics it is equivalent 
to positive greatest fixed point logic (GFP + ) and captures PTIME over finite (ordered) structures {6l. 
On the other hand, under the strict semantics, inclusion logic is equivalent to ESO and hence captures 
NP El. The question whether there is a natural fragment of dependence logic capturing PTIME was 
recently considered in (3]j and a fragment D*-Horn satisfying D*-Horn = SO3-Horn = PTIME over 
finite successor structures was identified. 

In El the fragment of dependence logic allowing only sentences in which dependence atoms of arity 
at most k may appear (atoms =(x \,... . : x n ) satisfying n < k + 1) was shown to correspond to the k-ary 
fragment ESOy(k-ary) of ESO in which second-order quantification is restricted to at most k- ary func¬ 
tions and relations. Also, the fragment D(fcV) in which at most k variables are allowed to be universally 
quantified was related to a fragment ESO/(kV) of ESO consisting of Skolem normal form sentences 
with at most k universal first-order quantifiers. Similar results have been obtained for independence and 
inclusion logic (for the strict semantics) in lf5l[T2l. 

The combined complexity of the model-checking problem of dependence logic, and many of its 
variants, was recently shown to be NEXPTIME-complete Q. On the other hand, the satisfiability prob¬ 
lem for the two variable fragment of dependence logic was shown to be NEXPTIME-complete in lfT5l . 
Recently, this result has been generalized to cover many of the variants of dependence logic lfl6l . 

The starting point for the present work are the following results of fl4l showing that the non-classical 
interpretation of disjunction in team semantics makes the model-checking of certain quantifier-fee for¬ 
mulas very complicated. Define 0j and </b as follows: 

1. is the formula =(x,y) V =(u,v), and 

2. 02 is the formula =(x,y ) V =(u,v) V =(n,v). 

Surprisingly, the data complexity of the model-checking problem of 0 1 and 02 is already NL-complete 
and NP-complete, respectively. In fl4l it was also shown that model-checking for tp V ty where tp and 
ly are 2 -coherent quantifier-free formulas of D is always in NL. A formula (p is called ^-coherent if, 
for all 21 and X, 21 \=x 0, if and only if, 21 |=y 0 for all Y C X such that |F| = k. Note that the left-to- 
right implication is always true due to the downwards closure property of dependence logic formulas. 
The downwards closure property also implies that, for dependence logic formulas, the strict and the lax 
semantics are equivalent. For independence and inclusion logic formulas this is not the case. 

In this article our goal is to give as complete picture as possible of the tractability frontier of data 
complexity of model-checking of formulas of dependence, independence, and inclusion logic under the 
lax team semantics. In order to state our results, we define a new syntactic measure called the disjunction- 
width d v («jO) of a formula (p. Our results show that, for quantifier-free formulas (p of dependence logic, 
the data complexity of model-checking is in NL if d v («p) < 2. Surprisingly, for independence logic 
the case of quantifier-free formulas turns out to be more fine grained. We give a complete charac¬ 
terization also in this case and, in particular, exhibit a quantifier-free formula with d v (<p) < 2 whose 
data-complexity is NP-complete and a seemingly maximal fragment in NL. For quantified formulas, the 
complexity is shown to be NP-complete already with simple formulas constructed in terms of existential 
quantification and conjunction in the empty non-logical vocabulary. 
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For inclusion logic, we show that model-checking can be reduced to the satisfiability problem of 
dual-Horn propositional formulas. While interesting in its own right, this also provides an alternative 
proof of the recent result of (61 showing that the data complexity of inclusion logic is in PTIME, and is 
also analogous to the classical result of Gradel on certain Horn fragments of second-order logic ®. 

2 Preliminaries 

In this section we briefly discuss the basic definitions and results needed in this article. 

Definition 1. Let 21 be a structure with domain A, and V = {xi,... ,Xk } be a finite (possibly empty) set 
of variables. 

• A team X of 21 with domain Dom(X) = V is a finite set of assignments s : V —>• A. 

• For a tuple x = (xi,... ,x n ), where x,- E V, X(x) := {s(x) : s E X} is the n-ary relation of A, where 
s(x) := (s(x!),...,s(x n )). 

• For W C V, X \ W denotes the team obtained by restricting all assignments of X to VV'. 

• The set of free variables of a formula <p is defined as in first-order logic and is denoted by Fr(0). 

We are now ready to define team semantics. As now customary, we will restrict attention to formulas 
in negation normal form, and use the Lax semantics introduced in 0] that differs slightly from the 
semantics defined in ifTTll . Below 21 |= v a refers to the satisfaction in first-order logic, and s(m/x ) is the 
assignment such that s(m jx) (x) = m, and s(m/x) (y) = s(y) for y y^x. The power set of a set A is denoted 
by 0>(A). 

Definition 2. Let 21 be a structure, X be a team of A, and (p be a first-order formula such that Fr(«p) C 
Dorn(X). 

lit: For a first-order literal a, 21 \=x a if and only if, for all s E X, 21 \= s a. 

V: 21 \=x ys V 9 if and only if, there are Y and Z such that Y U Z = X, 211=y y/ and 211 =z 9. 

A: 21 \=x yt A 6 if and only if, 21 \=x ys and 21 \=x 9. 

3: 21 j=x 3xy/ if and only if, there exists a function F : X —> ,'Z (A ) \ {0} such that 21 \=x(f/x) ¥■ where 

X{F/x) = {s(m/x) : 5 €X,m G F(s)}. 

V: 2t \=x Vxi/r if and only if, 211 =x(a/x) ¥> where X (A/x) = { s(m/x ) :s €X,m€ A}. 

A sentence (j) is true in 21 (abbreviated 211= (j)) if 21 |=i w i (j). Sentences (j) and qY are equivalent, (j) = (j)', 

if for all models 21, 211= (j) 4A 2t \= (p'. 

First-order formulas satisfy what is known as the Flatness property: 21 \=x 0, if and only if, 211 = s (j) 
for all s € X. Next we will give the semantic clauses for the new dependency atoms: 

Definition 3. • Let x be a tuple of variables and let y be another variable. Then =(x,y) is a depen¬ 

dence atom, with the semantic rule 

21 \=x =(x,y) if and only if for all s,s’ € X, if s(x) = /(x), then s(y) = s l (y)\ 

• Let x, y, and z be tuples of variables (not necessarily of the same length). Then x _Ly z is a 
conditional independence atom, with the semantic rule 

21 |=x x A.y z. if and only if for all s,s' € X such that .v(y) = s'(y), there exists a s” G X such that 
s"(xyz) = s(xy)s\z). 
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Furthermore, when z is empty, we write ,v_Ly as a shorthand for x±~y, and call it a pure indepen¬ 
dence atom ; 

• Let x and y be two tuples of variables of the same length. Then x C y is an inclusion atom, with 
the semantic rule 

21 \=x x C y if and only if for all s € X there exists a s' £ X such that s' (y) = s(x). 

The formulas of dependence logic, D, are obtained by extending the syntax of FO by dependence 
atoms. The semantics of D-formulas is obtained by extending Definition 2 by the semantic rule defined 
above for dependence atoms. Independence logic, FO(_L c ), and inclusion logic, FO(C), are defined 
analogously using independence and inclusion atoms, respectively. 

It is easy to see that the flatness property is lost immediately when FO is extended by any of the 
above dependency atoms. On the other hand, it is straightforward to check that all D-formulas satisfy 
the following strong Downwards Closure property: if 21 \=x 0 and Y C X, then 21 \=y 0. Another basic 
property shared by all of the above logics is called Locality. 21 \=x 0, if and only if, 21 1=a" [F-r(p; 0- 

In this article we study the data complexity of model-checking of dependence, independence, and 
inclusion logic formulas. In other words, for a fixed formula (p of one of the aforementioned logics, we 
study the complexity of the following model-checking problem: given a model 21 and a team X, decide 
whether 21 \=x 0 ■ 

We assume that the reader is familiar with the basics of complexity theory. 


3 Dependence and independence logics 

In this section we consider the complexity of model-checking for quantifier-free and quantified formulas 
of dependence and independence logic. 

3.1 The case of quantifier-free formulas 

In this section we consider the complexity of model-checking for quantifier-free formulas of dependence 
and independence logic. For dependence logic the problem has already been essentially settled in lfl4l . 
The following theorems delineate a clear bander between tractability and intractability for quantifier-free 
dependence logic formulas. 

Theorem 4 (HU). The model checking problem for formula =(x,y) V ={z. v) is NT-complete. More 
generally, the model-checking for (p V >(/ where tp and y/ are 2-coherent quantifier-free formulas of D is 
always in NL. 

When two disjunctions can be used, the model checking problem becomes intractable as shown by 
the following results. 

Theorem 5 (HU). The model checking problem for formula ={x,y) V =(z,v) V =(z,v) is NP -complete. 

In order to give a syntactic analogue of TheoremlU we define next the disjunction-width of a formula. 
Definition 6. Let a be a relational signature. The disjunction-width of a (7-formula tp, denoted d v (<p), 
is defined as follows: 

1 if tp is y ±x Z or =(x,y) or x C y 
0 if (p is R(x) or -i R(x), for R € aU {=} 
max(d v (<Pi),d v ((C> 2 )) if (pis (pi A (p 2 
dv(<pi) + dv(<p 2 ) if (P is (Pi V<p 2 
d v (<Pi) if (p is 3x(pi or V<pi. 


d v (<P) = < 
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The next theorem is a syntactically defined analogue of Theorem 0] 

Proposition 7. The data complexity of model-checking of quantifier-free D-formulas 0 with d , ( 0) <2 
is in NL. 

Proof We will first show that a formula 0 with d v (0) = 1 is 2-coherent. This follows by induction using 
the following facts lfl4l : 

• dependence atoms are 2-coherent, and first-order formulas are 1-coherent, 

• if \(/ is k-coherent, then I//V0 is also ^-coherent assuming 0 is first-order, 

• if y/ is ^-coherent and 0 is k < /-coherent, then A 0 is /-coherent. 

It is also straightforward to check that the data complexity of a formula 0 with dy (0) = 1 is in Logspace 
(the formula 0 can be expressed in FO assuming the team X with domain x = Fr(</)) is represented by 
the n-ary relation X (x)). We will complete the proof using induction on (p with d v (0) = 2. Suppose that 
(p = y /V0, where d v (vO = d v (0) = 1- Then the claim follows by Theorem0] The case (p = t/r A0 is 
also clear. Suppose finally that (p = \\r V 0, where 0 is first-order. Note that by downward closure 

21 \=x <P 21 \=x> VL 

where X' = {5 E X \ 21 \f s 0}. Now since X' can be computed in Logspace, the model-checking problem 
of 0 can be decided in NL by the induction assumption for 1 //. □ 

In this section we examine potential analogues of Theorems 0] and [5] for independence logic. It is 
well-known that the dependence atom =(x ,y) is logically equivalent to the independence atom y _L y }’• 
Hence, the following is immediate from Theorem I5l lfl4l . 

Corollary 8. The model checking problem for formula 

y !* y V v l z v V v l z v 


is NP-com/;/<?/<?. 

For independence logic, the situation is not as clear, in particular concerning tractability. In the 
following we will exhibit a fragment of independence logic whose data complexity is in NL and which 
is in some sense the maximal such fragment. 

Definition 9. The Boolean closure of an independence atom by first-order formulas, denoted BC(_L,FO), 
is defined as follows: 

• Any independence atom x Ty 2 is in BC(_L,FO). 

• If <p € BC(±,FO), then for any formula 0 € FO, <p A 0 and (p V 0 are in BC(±,FO). 

Let (p G BC(_L,FO). Up to permutation of disjunction and conjunction, (p can be put into the follow¬ 
ing normalized form: 


(p = ((... ((* y A 0i) V Vi) A ...) A 0*) V y/ k 

Let 21 be any structure, £ + = fjLi 0/(21) and = ULi V 7 /(21), where, 0,(21) is the set of assign¬ 
ments s: Fr(«p) —> A such that 2t \= s 0,-. We can restate the fundamental property for satisfiability of an 
independence atom in a team (and a structure) to tackle the case of BC(_L,FO) formulas. It holds that, 
for any <p G BC(_L,FO), any team X and structure 21, 21 \=x ( P if and only if: 
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• for all 5 £ A: either 5 £ [((..(y/i A 02 ) V ...) A 0*) V y 4 ]( 2 l) or 5 £ <£+, 5 0 £ and 

• for all .si ,s 2 £ A' such that s 1 , S 2 £ CL, s ]. s 2 0 <£“, and .vi (z) = s 2 (z), there exists 53 £ A such that: 

S 3 (z) = si(z),s 3 (x) = s\(x) ands 3 (y) = s 2 (y). 

The first item is true by exhaustive case distinction. The second one comes from the fact that if a 
tuple s satisfies s £ £ + and ,v f £ then it is forced to be in the sub-team satisfying x _Lj y. 

In the rest of the paper, assignments si,S 2 as in the second item will be said compatible for formula 
(p and team A and 53 is called a witness of S], s 2 (for formula <p). 

Since checking whether a tuple s belongs to the query result 0(21) of a first-order formula can be 
done in logarithmic space, deciding whether 21 \=x (p is in Logspace. The following tractability result 
can be obtained. 

Theorem 10. The data complexity of the model checking problem for formulas of the form (p\ V (p 2 with 
(pi,(p 2 £ BC(_L,FO) is in NL. 

Proof The proof is given by a Logspace reduction to the satisfiability problem of 2-CNF formulas which 
is well-known to be in NL. Given a structure 21 and a team X we construct a 2-CNF propositional formula 
(p such that: 

21 \= x <Pi V 2 •<=>■ <E> is satisfiable. (1) 

Recall that if a team X is such that 21 \=x (pi V (p 2 then, there exists F, Z C A such that Y UZ = X 
and 21 |=y <pi and 21 \=z (p 2 - For each assignment s £ X, we introduce two Boolean variables Y f.v] and 
Z[,v]. Our Boolean formula <P will be defined below with these 2\X\ variables the set of which is denoted 
by Vai ( <T> ). It will express that the set of assignments must split into Y and Z but also make sure that 
incompatible assignments do not appear in the same subteam. 

For each pan - Sj,sj that are incompatible for (p 1 on team X, one adds the 2-clause: -iF[s,-] V -W[.v 7 ]. 
The conjunction of these clauses is denoted by Cy. Similarly, for each pair s^sj that are incompatible for 
(p 2 on team X, one adds the clause: ->Z Nv-z[ s j] and call Cz the conjunction of these clauses. 

Finally, the construction of <p is completed by adding the following conjunction: 

C= f\Y[s]XZ[s}. 

sex 

It is not hard to see, due to the remark on compatible pairs, that the formula = C A Cy A Cz can be 
built in Logspace. It remains to show that the equivalence dT|) holds. 

Assume that the left-hand side of the equivalence holds. Then, there exists F,ZC X such that Y U Z — 
X, 21 \=y (pi and 21 |= z (p 2 . We construct a propositional assignment 7 : Var(d>) —>{0,1} as follows. For 
all s £ Y, we set I(Y [ 5 ]) = 1 and for all s £ Z, we set similarly 7(Z[s]) = 1. 

Let us consider a clause -iF[.q] V — 'L7vy] for incompatible Si,Sj. Then, /(T[.>,■]) = 0 or / ( Y [.vy] ) = 0 
must hold. For a contradiction, suppose that 7(7[s,-]) = KY'sf) = 1. Then since 21 |=y (p 1 holds, by 
construction Sj and sj must be compatible for <pi. Hence we get a contradiction and may conclude that I 
satisfies -iTfs,-] V -^Y'sf. The situation is similar for each clause ->Z\sj\ V -<Z\sj\. Finally since X = YUZ, 
I also satisfies C. 

Let us then assume that 4> is satisfiable, and let I: Var(<I>) ^ {(). I} he a satisfying assignment for 4>. 
Since 1 1= C, we get that 7(T[s]) = 1 or I(Z[s\) = 1 for all s £ X. Let 

X ¥ = {s: 7(y[s]) = 1} and A z = {s : I{Z[s}) = 1}. 

Note that Xy U Xz = X. We will next show how the sets Xy and Xz can be extended to sets Y and Z 
such that 21 [=y (pi and 21 |=z (pi- Note first that, since I satisfies <F. for all ,v 1 ,.s ’2 £ Ay, <P cannot have 
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a clause of the form -i7[5i] V -i7 [ 52 ], and hence .v 1,52 arc compatible for (p[. Analogously we see that 
all si,S 2 £ Xz arc compatible for (p 2 - We will define the sets Y and Z incrementally by first initializing 
them to Xy and Xz, respectively. Note that even if XyUXz = X, no decision has been made regarding the 
membership of assignments 5 in Y (resp. Z) such that /( Y [ 5 ]) = 0 (resp. I(Z[s\) = 0). Let us first consider 
Y. Until no changes occur, we consider all pairs .V] ,S 2 £ Y such that .s | (z) = S 2 (z) and add into Y (if they 
arc not already in) all tuples 53 £ X such that 53 is a witness for the pair (5 1 , S 2 ) regarding property (pi. 
Since by construction 5i ,52 are compatible then at least one such 53 exists (but may be out of Y till now). 
We prove below that this strategy is safe. First of all, it is easily seen that any pair among { 51 , 52 , 53 } is 
compatible for (p\ . Therefore, it remains to show that the new assignments 53 arc compatible with every 
other element 5 added to Y so far. Suppose this is not the case and that there exists 5 £ Y \ { 51 , 52 } such 
that 53 and 5 arc incompatible for (py In passing one must have 53 (z) = s(z). Since 51 , 52 , and 5 arc in Y 
they are all pairwise compatible. Flence, there exists t\ such that t\ is a witness for the pair ( 51 , 5 ). 

Then, t\ (x) = 5| (x) = 53 ( 1 ), and t\(y) = s(y). Consequently, t\ is also a witness for 53,5 hence, 53 
and 5 are compatible which is a contradiction. Therefore, the assignment 53 can be safely added to Y. 
The set Z is defined analogously. By the construction, it holds that 211 =y (pi and 21 \=z qh. 

□ 

We will next show that a slight relaxation on the form of the input formula immediately yields 
intractability of model-checking. 

Theorem 11. The exists a formula (pi V (pi such that (pi £ BC(_L,FO) and <p 2 is the conjunction of two 
independence atoms whose model-checking problem is NP -complete. 

Proof Define (pi = w f \ A x A, y, (pi = ci _L C C 2 Ax _L Z y. We will reduce 3-SAT to the model¬ 
checking problem of (pi V (p 2 - Let = /\”=i Q be a 3-SAT instance. Each C, = V V p n with 
PhiPhiPh F {vi,...,v m ,-ivi,...,->v,„ }. To this instance we associate a universe 21 and a team X on 
the variables w,c,ci,C 2 ,z,x,y,t. The structure 21 is composed of m new elements ai,...,a m and of 
{ vi ,..., v m , —>vi,..., ->v m } U { 0,1}. For each clause Cj we add in X the 6 assignments displayed on 
the left below, and for each variable v,-, we add to X the 2 assignments on the right: 


w 

c 

Cl 

Cl 

z 

X 

y 

t 

0 

i 

1 

1 

h 

Ph 

Ph 

a 6i+ 1 

0 

i 

1 

1 

h 

Ph 

Ph 

^6i+2 

0 

i 

1 

1 

h 

Ph 

Ph 

a 6i+3 

1 

i 

0 

0 

0 

0 

0 

«6r+4 

1 

i 

1 

0 

0 

0 

0 

^6/+4 

1 

i 

0 

1 

0 

0 

0 

^6i+4 


w 

c 

Cl 

cz 

z 

X 

y 

t 

~0~ 

0 

0 

0 

i 

Vi 

Vi 

a 6(n+\)+i 

0 

0 

0 

0 

i 

-'Vi 

-iVi 

a 6(n+\)+i 


We will next show that is satisfiable if and only if 21 \=x <j>- 

=^- Suppose there is an assignment I: {vi,.., ,v m } — > {0, 1} that evaluates <F to true, i.e., at least one 
literal in each clause is evaluated to 1. We have to split X into two sub-teams X = Y U Z such 
that 21 \=y (w ^ 1 Ax _L f y) and 21 |=z (ci J_ f ci Ax T z y). We must put every assignment 5 £ X 
such that 5(w) = 1 in Z. There are exactly three such assignments per clause. We put in Z every 
assignment 5 such that s(x) = v, if 7(v,) = 1, and s(x) = -1 v,- if 7(v,-) = 0. The other assignments 
are put into Y. 

For each clause Ci, one literal p h ,p n , /?/, is assigned to 1 by 7. Then there is at least one as¬ 
signment 5(c,ci,ci) = (i,l,l) in Z. In Z, the assignments mapping c to i map (c,ci,co) to 
!),(/, 1,0), (/,0,1) or (i,0,0). Thus i=z c, c,. 
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If S\,S 2 G Z are such that sj(z) = s 2 (z) = i, then si(x) (analogously s 2 (x)) is v, if 7(v,-) = 1, —'V,- 
otherwise. Therefore, .sy (x) = s 2 (x) = ,V| (y) = s 2 (y), and hence 211 =z x _U y holds. 

As for Y, it is immediate that 21 \=y w / 1. The only pair of assignments s\,s 2 in X such that 
si(t) = s 2 (t) are (0,0,0,0, i,Vi,v;,a*) and (0,0,0,0,/,-iv,-,-'V i -,a^). Only one of them is in Y (si if 
7(vj) = 1, s 2 otherwise). Thus 21 1 =y x _L, y. 

<t= Suppose then thatX = Y UZ such that 21 \=y (w 1 Ax T, y) and 21 \=z (c\ _L C c 2 Ax _L Z y). Define 
an assignment 7 of the variables of <3> by: 7(v;) = 1 if s\ := (0,0,0,0,/,v ( -,v;,ajt) is in Z, 7(v,-) = 0 
if sf := (0,0,0,0,/, —>v/, —'V/,«*) is in Z. Since 21 \=zx _Uy, sf(z) = s\(z) and because there is no 
s' G X such that s' (x) = sf (x) , s' (y ) =s\(y) and s'(z) = s{(z) , for each i at most one of sf, s\ can be 
in Z. Similarly, because 21 1 =y x _L f y, only one of them can be in Y . Thus 7 is indeed a function. 

Since 21 \=zx _U y and there is no assignment inX such that (x,y) i->- (v/, —'V/), every pair si,s 2 €Z 
such that si(z) = x 2 (z) = i must have the same value of x and y. Every assignment representing 
a clause in Z respects the choice of 7. Furthermore, since 21 \=z c\ T c c 2 and (w,c,ci,c 2 ) 
(1,7,0,0), (1,/, 1,0), (l,i, 0,1) are in Z, (1,/, 1,1) must be in Z, i.e., at least one assignment per 
clause is in Z. By the above we may conclude that 7 satisfies <t>: at least one literal per clause is 
evaluate to 1 by 7. 


□ 

Hardness result of Proposition [8] concerns conditional independence atoms. We prove an analog for 
the case of pure independence below. 

Theorem 12. The model checking problem is NP-complete for formula (j) of the form 

(j) = (x T y) V (x T y) V (x 1 y) Vx ^ y 

Proof Let G = (Vo.Eq) be a graph, 21 = \f, he a first order structure of the empty signature, and X be the 
team X = {(v,v) |v £ Vg}U{ (vi,v 2 ), (v 2 ,vi) | (vi,v 2 ) € Eg} ( we write an assignment s with s(x) = v\ 
and s(y) = v 2 succinctly as (vi,v 2 )). We are going to show that G has a 3-clique cover if and only if 

21 hx O. 

=y Suppose that G has a 3-clique cover, i.e., there exists Ci,C 2 ,C 3 three cliques such that Vq = Vci U 
Vc 2 U Vc 3 ■ We have to prove 21 \=x 0 ■ For 7 € { 1,2,3}, let 

Xj = {(v,v) |v G C/}U{(vi,v 2 ),(v 2 ,vi) |vi,v 2 G C,-} 


and X 4 = X \ (Xj U X 2 U X 3 ). 

Because it is a vertex cover, every assignment of the form (v, v) is contained in Xi UX 2 UX 3 and 
not inX 4 , i.e. 21 \=x 4 xf^y. 

Let i G {1,2,3} and s,s' G X, be two assignments. If s(x,y) = (v,v) and s'(x,y) = (v',v'), then 
y, v’ G C; and there exists two assignments s\,s 2 in X, such that 33 (x,y) = (v,v') and s 2 (x,y) = (v',v) 
by construction. Similarly if s(x,y) = (v, v) and s'(x,y ) = (vi,v 2 ), there exists inX; the assignments 
(v,v 2 ) and (vi,v) (even if vj = v or v 2 = v). Finally, if s(x,y) = (vi,v 2 ) and s'(x,y) = (v),v(), 
the assignments (vi ,V]), (v 2 ,v 2 ), (v , ] , v)), (v 2 , v 2 ) are in X, and so are (vi,!^), (v}^). The above 
implies that 21 j=x,- x J_ y. 
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<= Suppose that 21 \=x (x _L y) V {x _L y) V (x X y) \I x / y, then X =X\ UX 2 UX 3 UX 4 such that 21 \=x i 
x _L y for i £ {1,2,3 } and 21 \=x 4 x / v. For i £ {1,2,3} let C, be the graph whose vertices arc 
{ v | (v, v) € Xj } and edges are 

{(vi,v 2 ) | (vi,v 2 ) £Xj and (v 2 ,vi) GX,-}. 

Note that some C, can be empty but they form a vertex cover of G as no assignment (v, v) is in 
X 4 . If v, v' £ Ci then (v,v) and (v',v') arc in Xj. By independence, (v,v 7 ) and (v',v) arc also in 
Xj. Therefore the edge (v,v r ) is in C,: C, is a clique. Therefore, G is covered by the three disjoint 
cliques C\ =C l ,C ' 2 = C 2 /C, and C ' 3 = C 3 /(C, UC 2 ). 

□ 


3.2 The case of quantified formulas 

In this section we show that existential quantification even without disjunction makes the model checking 
problem hard for both dependence and independence logic. 

Theorem 13. There is a formula (p of dependence logic of empty non-logical vocabulary build with 3 
and A whose model-checking problem is NP -complete. 

Proof. Define the formula (j) as follows: 

0 = 3x=(x,ri,r 2 ,e,m) A=(vi,v 2 ,x). 

We will reduce the problem of determining whether a graph G with i f vertices is /i-colorablc to the 
model-checking problem of (j). This graph problem is easily seen to be NP-complete. 

Let G be a graph with n 2 vertices Vg = {«o, • • •, ot n 2 _ 1 }, 21 = {0,... ,n — 1} a first order structure of 
the empty signature and X = { 5 } | i £ {0 ,...,if — 1}, 0 < j < /'} be a team such that : 

• s/(vi) = [i/n\ and 5-(v 2 ) = i mod /;. In other words, s/(vi,v 2 ) is the decomposition of i in base 
n. 

• s '[ (n) = [,i/n\ and sj (r 2 ) = j mod n. In other words, s[ (r\ , r 2 ) is the decomposition of j in base 
n. 

• s\! ( m ) = 0 if i f j and s\{m) = 1. 

• s\,(e) = 1, if i' = i, or if there is an edge between a,- and a,/ with i' > i. Otherwise s\,(e) = 0. 

For example, for n = 2 and Eq = {(0,1); (1,2); (0,2); (2,3)}, we obtain the following team on the 
universe A = {0,1}: 


X 

Vi 

v 2 

n 

n 

m 

e 


0 

0 

0 

0 

1 

1 


0 

1 

0 

0 

0 

1 


0 

1 

0 

1 

1 

1 


1 

0 

0 

0 

0 

1 


1 

0 

0 

1 

0 

1 


1 

0 

1 

0 

1 

1 


1 

1 

0 

0 
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0 

1 

0 

0 


1 

1 

1 

0 

0 

1 


1 

1 

1 

1 

1 

1 
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G is n-colourable iff 21 \=x 0 

We are going to demonstrate that 21 \=x 0 if and only if G is n-colourable. 

First the left to right implication. Since 21 \=x 0 there exists a mapping F: X —> &(A) \ {0} such 
that 21 | =x(F/x) = ( x - r \ ■ri.e.ni) A = (vj ,v 2 ,x). By downwards closure, we may assume without loss of 
generality that F(s) is a singleton for all ,v G X. Since =(vi,v 2 ,x) holds, F induces a mapping F ': Vq — >A, 
by F'{oti) = s'- (x). If there is an edge between a, and a,->, i' > i, then sUe) = 1 = s\{e). Furthermore, 
4(ri,r 2 ) = s‘;(r i,r 2 ) = / but sj(in) = 0 and s\(m) = 1 Therefore, because the atom =(x.r\ .n.e.m) holds, 
we must have sUx) / sj (x) (and F'(cXj) f F'(af )) if there is an edge between a, and a,/. Thus F' is a 
colouring of G with |A| = n colours. 

Let us then consider the right to left implication. Let c : Vo —> {0,... ,n — 1} be an n colouring. We 
extend X to variable x with a new team X' such that sj (xj = c(a,j. The value of x depends only on /, 
which is encoded in (vi,v 2 ), i.e., 21 \=x' =(vi,v 2 ,x). 

Let s-,si be two assignments of X'. Suppose that sj(r\ ,r 2 .e) = si (r l ,r 2 ,e) but sj(m ) ^ si (m). In 

7 j' 

this case we must check that sj (x) is different from si (x) (because 21 \=x> =(x, r \, r 2 .e. m)). Now it holds 

•7 • •/ 

that j = j because s- (r t . r 2 ) = si (n,r 2 ). Furthermore, since sj(m) ^ si (m), either i = j or i = /. Let 
us suppose i = j. Because 1 = s\{e) = sj (e) = si (e) = s l -,{e), there is an edge between a,- and a? in G. 
Therefore c(z) / c(i') and sj (x) ^ sj, (x). □ 

By encoding dependence atoms in terms of conditional independence atoms we get the analogous 
results for free for independence logic. 

Corollary 14. There is a formula (p of independence logic of empty non-logical vocabulary build with 3 
and A whose model-checking problem is NP-complete. 

We end this section by noting that existential quantifiers cannot be replaced by universal quantifiers 
in the above theorems. 

Proposition 15. The model-checking problem for formulas of dependence or independence logic using 
only universal quantification and conjunction is in Logspace. 

Proof We first transform (p into prenex normal-form exactly as in first-order logic flTTTl . We may hence 
assume that tp has the form 

Vxi ... Vx„ A @i (xt j • • • j x n , y t ,..., y m ) i 

where 0, is either a first-order, dependence, or independence atom. Let 21 be a model, and X be a 
team of A with domain {x \,... ,x„,yi,... ,y m }. As in ifTTl . the formula /\ 0,-(xi,... ,x„,yi,... ,y m ) can be 
expressed by a first-order sentence y/ when the team X is represented by the n + m-ary relation X (x.y), 
that is, 

21 Nx A 0;(xi,...,x„,yi,...,y m ) <£> (2t,X(x,y)) 1= iff. 

Since X (x.y) is a first-order definable extension of X (y) it is clear that we can construct a FO-sentence 
yf such that 

21 K Vx/\9i(x,y) o (2t,X(y)) |= yf, 


holds for all structures 21 and teams X with domain {yi,... ,y m }- The claim follows from the fact that the 
data complexity of FO is in Logspace. □ 
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4 Inclusion Logic 

In this section we show that the model-checking problem of inclusion logic formulas can be reduced to 
the satisfiability problem of dual-Horn propositional formulas. A propositional formula in conjunctive 
normal form is called dual-Horn if each of its clauses contain at most one negative literal. 

For a team X,x = {x (1 ,...,x !n } C dom(X), and s G X, we denote by six) the restriction of s to the 
variables x,-,,...,x, n . In this section, a denotes a relational signature. 

Proposition 16. There exists an algorithm which, given (p G FO(C), a structure 21 over a, and a team X 
such that (p C dom(A), outputs a propositional formula V P in dual-Horn form such that: 21 \=x <p < t=^ 'P 
is satisfiable. Furthermore, when (p is fixed, the algorithm runs in logarithmic space in the size o/2l and 
X. 

Proof Let <p, 21,X be as above and ix = |dom(X)|. For any team X, we will consider the set X of propo¬ 
sitional variables X [ 5 ] for 5 € A rx . Stalling from c p , 21, and X we decompose step by step formula <p into 
subformulas (until reaching its atomic subformulas) and different teams Y , Z, ... and control the rela¬ 
tionships between the different teams by propositional dual-Horn formulas built over the propositional 

variables issued from X,Y,Z, . Let ST = {((p,X,iy)} and ^ = {X[s] : 5 € X} U {“Xfs] : s $.X}. The 

propositional formula l P is now constructed inductively as follows. 

As long as ST 0, we apply the following rule: Pick (<j p,X,r ) in ST and apply the following rules. 

• If (p is R(x) withX € 0 then: ST := S fi \{((p,X,r)} and ^ := c S’U{X[s] —>R(s(x)) : for all 5 G A 1 }. 
Clearly, it holds that 21 \=x R(x) iff /\s(x)£R -, X [> v ] is satisfiable. 

• If (p is x C y then: =5^ := S^\{((p,X,r)} and 

:= tfU{X[s] -X V.s , eA r ,y(y)=s(x) X [s ; ] : s € A r }. 

It holds that 21 \=x x C y iff f\ seA r(X [,v] —> Vi'gA'',s'(y)=i(I)^[' s ' , ]) i s satisfiable. 

• If <p is 3xy/, then: ST := (S^\{((p,X,r)}) U {(yr, Y,r + l)}and 

^ := tfU{X[s] -x W= M , aeA y M : * e A''}, 

where the F[s], s € A r+l arc new propositional variables (not used in ( T). If 21 \=x 3x\jf then, there 
exists a function F: X -x !'TT(A) \ {0}, such that 211 =x(F/x) ¥• I n other words, 211 =y y for some 
team Y defined by the solutions of the constraint AseA^M “A Vv=(s,a), ax a HA (which define a 
suitable function F). Conversely, if 21 |=y 1 // for a team Y as above defined from X, then clearly 
21 \= x 3xt/r. 

• If <p is Vxt/r, then: ST := (S^\{((p,X,r)}) U {(yr, Y,r + 1)} and 

^:=^U{X[^7[s']:ieAVeA r+1 s.t. s'(x) = six)}, 

where the F[s], s G A r+l arc new propositional valuables (not used in r S). The conclusion is similar 
as for the preceding case. 

• If (p is Yi A t j/ 2 then: ST := (S^\{((p,X,r)}) U {(t/ri,X,r), (\j/ 2 ,X,r)} and Z? is unchanged. By 
definition, 21 \=x <p iff 21 \=x Vfi A \[f 2 - 
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• If (p is y'l V y /2 then: j? := (^\{(<p,X,r)}) U {(yq ,Y,r), (ys 2 ,Z,r)} and 

tf:=tfU{X[s}^Y[s}\/Z[s}:s£A r }\j{Y[s} ->X[s],Z[.y] ->X[s] :sGA r } 

where again the Y [ 5 ] and Z[s], s £ A r are new propositional variables (not used in Z'). Here again, 
21 \=x tp if and only if 21 \=y yq and 21 \=z y /2 for some suitable Y and Z such that Y UZ = X which 
is exactly what is stated by the Boolean constraints. 

Observe that each new clause added to Z 7 during the process is of dual-Horn form, i.e., contains at 
most one negative literal. Observe also, that applied to some ( (p,X,r ), the algorithm above only adds 
triples TZ whose first component is a proper subformula of <p and eliminates (tp,X,r). When the formula 
(p is atomic, no new triple is added afterwards. Hence the algorithm will eventually terminate with 
= 0. Setting V P := Ace^C it can easily be proved by induction that: 21 \=x ( P iff 'F is satisfiable. 
Observe also that each clause in Z 7 can be constructed from X and 21 by simply running through their 
elements (using their index) hence in logarithmic space. □ 

Remark 1. The construction of Proposition [16] can be done in principle for any kind of atom: depen¬ 
dence, independence, exclusion, constancy etc. To illustrate this remark, one could translate in the above 
proof a dependence atom of the form =(x,y) by (using the notations of the proof): 

A hxtfv^xW). 

s s'EA r 

The additional clauses are of length two. A similar treatment can be done for independence atoms 
x J_y z. In the two cases however, the resulting formula is not in Dual-Horn form anymore and there is 
no way to do so (unless PTIME = NP). 

Since deciding the satisfiability of a propositional formula in dual-Horn form can be done in polyno¬ 
mial time we obtain the following already known corollary. 

Corollary 17. The data complexity of FO(C) is in PTIME. 


5 Conclusion 

We have studied the tractability/intractability frontier of data complexity of both quantifier-free and quan¬ 
tified dependence and independence logic formulas. Furthermore, we defined a novel translation of inclu¬ 
sion logic formulas into dual-Horn propositional formulas, and used it to show that the data-complexity 
of inclusion logic is in PTIME. It is an interesting open question whether the translation of Proposition 
16 can be generalized to hold for some interesting extensions of FO(C) by further dependency atoms. 
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